A parallel computational framework for ultra-large-scale sequence clustering analysis

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallel Hierarchical Clustering in Linearithmic Time for Ultra-Large-Scale Sequence Analysis Supplementary Data

Parallel Hierarchical Clustering in Linearithmic Time for Ultra-Large-Scale Sequence Analysis Supplementary Data Qi Mao1, Wei Zheng2, Li Wang4, Yunpeng Cai5, Volker Mai6, Yijun Sun1,2,3∗ Department of Microbiology and Immunology, Department of Computer Science and Engineering, Department of Biostatistics, The State University of New York at Buffalo, Buffalo, NY 14203, USA. The Institute for Com...

متن کامل

A partition-based algorithm for clustering large-scale software systems

Clustering techniques are used to extract the structure of software for understanding, maintaining, and refactoring. In the literature, most of the proposed approaches for software clustering are divided into hierarchical algorithms and search-based techniques. In the former, clustering is a process of merging (splitting) similar (non-similar) clusters. These techniques suffered from the drawba...

متن کامل

Large-scale parallel data clustering

Algorithmic enhancements are described that enable large computational reduction in mean square-error data clustering. These improvements are incorporated into a parallel data-clustering tool, P-CLUSTER, designed to execute on a network of workstations. Experiments involving the unsupervised segmentation of standard texture images were performed. For some data sets, a 96 percent reduction in co...

متن کامل

A Design Framework for Ultra-Large-Scale Autonomic Systems

The origins of ultra-large-scale (ULS) systems derive from social problems that are getting more and more complex, such as climatic monitoring, transportation, citizens protection and security. These factors imply a continuous increase of information systems that evolve towards ultra-dimension systems, requiring digital communication networks that allow for communication between people, between...

متن کامل

A Parallel Algorithm for Large-scale Multiple Sequence Alignment

Multiple sequence alignment is a central topic of extensive research in computational biology. Basically, two or more protein sequences are compared to evaluate their similarity and to identify conserved regions. This work reports a methodology for parallel processing of a multiple sequence alignment algorithm (ClustalW) in an environment of networked computers. A detailed description of the mo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Bioinformatics

سال: 2018

ISSN: 1367-4803,1460-2059

DOI: 10.1093/bioinformatics/bty617